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The search for the Higgs boson(s) is one of the major priorities at the up- 
graded Fermilab Tevatron and at the CERN Large Hadron Collider (LHC). 
Monte Carlo event generators (MCs) are heavily utilized to extract and inter- 
pret the Higgs signal, which depends on the details of the soft-gluon emission 
from the initial state partons in hadronic collisions. Thus, it is crucial to es- 
tablish the reliability of the MCs used by the experimentalists. In this paper, 
the MC based parton shower formalism is compared to that of an analytic re- 
summation calculation. Theoretical input, predictions and, where they exist, 
data for the transverse momentum distribution of Higgs bosons, Z° bosons, 
and photon pairs are compared for the Tevatron and the LHC. This com- 
parison is useful in understanding the strengths and the weaknesses of the 
different theoretical approaches, and in testing their reliability. 
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I. INTRODUCTION 



To reveal the dynamics of the electroweak symmetry breaking, a new generation of 
hadron colliders will search for the Higgs boson(s). The potential of the upgraded Fermilab 
Tevatron, the 2 TeV center of mass energy proton-antiproton collider starting operation 
within a year, was analysed in Ref . ]I| • Later in this decade, two experimental collaborations 
(ATLAS and CMS) join the search at the CERN Large Hadron Collider (LHC) with 14 TeV 
proton-proton collisions. An extraction of the Higgs signal at the LHC requires not only 
the precise knowledge of the signal and background invariant mass distributions, but also 
the accurate prediction of the corresponding transverse momentum (p?) distributions. In 
general, the determination of the signal requires a detailed event modeling, an understanding 
of the detector resolution, kinematical acceptance and efficiency, all of which depend on the 
Pt distribution. The shape of this distribution in the low to moderate px region, can dictate 
the details of both the experimental triggering and the analysis strategies for the Higgs 
search. It can also be used to devise an improved search strategy, and to enhance the 
statistical significance of the signal over the background |2]||. In the gg — > HX — > 77 AT 
mode at the LHC, for example, the shape of the signal and the background pt distribution 
of the photon pairs is different (c.f. Refs. |4]||), with the signal being harder. This difference 
can be utilized to increase the signal to background ratio. Furthermore, since vertex pointing 
with the photons is not possible in the CMS barrel, the shape of the pt distribution affects 
the precision of the determination of the event vertex from which the Higgs (decaying into 
two photons) originated. [] Thus, for a successful, high precision extraction of the Higgs 
signal, the theoretical calculation must be capable of reproducing the expected transverse 
momentum distribution. 



The vertex with the most activity is chosen as the vertex from which the Higgs particle has 
originated. If the Higgs is typically produced at a relatively high value of pt, then this choice is 
correct a large fraction of the time. 
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To reliably predict the px distribution of Higgs bosons at the LHC, especially for the 
low to medium px region where the bulk of the rate is, the effects of the multiple soft- 
gluon emission have to be included. One approach which achieves this is parton showering 
0. Parton shower Monte Carlo programs such as PYTHIA 0, HERWIG gj and ISA JET 
are commonly used by experimentalists, both as a way of comparing experimental data 
to theoretical predictions, and also as a means of simulating experimental signatures in 
kinematic regimes for which there are no experimental data yet (such as for the LHC). The 
final output of these Monte Carlo programs consists of the 4-momenta of a set of final state 
particles. This output can either be compared to reconstructed experimental quantities or, 
when coupled with a simulation of a detector response, can be directly compared to raw data 
taken by the experiment, and/or passed through the same reconstruction procedures as the 
raw data. In this way, the parton shower programs can be more useful to experimentalists 
than analytic calculations. Indeed, almost all of the physics plots in the ATLAS physics 
TDR ]I0[ involve comparisons to PYTHIA version 5.7. 

Predictions of the Higgs px can also be obtained utilizing an analytic resummation 
formalism, which sums contributions of o^Iy^^h/pt) (where itlh is the Higgs mass, and 
m < 2n — 1) up to all orders in the strong coupling as- In the recent literature, most 
calculations of this kind are either based on, or originate from, the low px factorization 
formalism [|TTJ (for the latest review see Ref. [0). This formalism resums the effects of 
the multiple soft-gluon emission while also systematically including the fixed order QCD 
corrections. It is possible to smoothly match the resummed result to the fixed order one in 
the intermediate to high px region, thus obtaining a prediction for the full px distribution 
13| . In this paper, we use this formalism as the analytic 'benchmark' to calculate the px 



distributions of Higgs bosons at the LHC, and of Z° bosons and photon pairs produced in 
hadron collisions. 

For many physical quantities, the predictions from parton shower Monte Carlo programs 
should be nearly as precise as those from analytic theoretical calculations. It is expected that 
both the Monte Carlo and analytic calculations should accurately describe the effects of the 



emission of multiple soft-gluons from the incoming partons, an all orders problem in QCD. 
The initial state soft-gluon emission affects the kinematics of the final state partons. This 
may have an impact on the signatures of physics processes at both the trigger and analysis 
levels and thus it is important to understand the reliability of such predictions. The best 
method for testing the reliability is a direct comparison of the predictions to experimental 
data. If no experimental data are available for certain predictions, then some understanding 
of the reliability may be gained from the comparison of the predictions from the two different 
methods. 

In the absence of experimental data for Higgs production, we can gauge the reliability of 
calculations for this process by comparing them to each other. We also compare predictions 
form the different formalisms to data for processes which are similar to Higgs production 
at the LHC In this way we can perform a genuine 'reality check' of the various theoretical 
predictions. Production of a light, neutral Higgs boson at the LHC in the standard model 
(SM) and its supersymmetric extensions proceeds via the partonic subprocess gg (through 
heavy fermion loop) — > HX. One of the major backgrounds for a light Higgs, in the mass 
range of 100 GeV ^ m H < 150 GeV, is diphoton production, a sizable contribution to which 
comes from the same, gg initial state. Since the major part of the soft-gluon radiation is 
initiated from the incoming partons, the structures of the resummed corrections are similar 
for Higgs boson and diphoton production. Because the latter is measurable at the Fermilab 
Tevatron, diphoton production provides an exceptional opportunity to test the different 
theoretical models. Z° boson production can also be a good testing ground for the soft- 
gluon corrections to Higgs production. The treatment of the fixed order and resummed 
QCD corrections for Z° boson production is theoretically well understood and implemented 
at next-to-next-to-leading order Furthermore, just as in the diphoton case, predictions 
can also be tested against Tevatron data. The Z° data have the advantage that sufficient 
statistics exist in the Run 1 data from CDF and DO to allow for detailed comparisons to the 
theoretical predictions. 
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II. LOW P T FACTORIZATION 



In this section the low transverse momentum factorization formalism and its matching 
to the usual factorization is reviewed. The problem arises as follows. When calculating 
fixed order QCD corrections to the px distribution of the inclusive process pp — > HX, the 
standard QCD factorization theorem is invoked 



da „ , x daj 1 j 2 



dp2 - fh/p( m H)® -^r{m H ,p T )® f h/p (m H ), (1) 



which is a convolution in the partonic momentum fractions, and is derived under the usual 
assumption pr S> m#. When pr <C ran occurs, as a result of soft and soft+collinear 
emission of gluons from the initial state, the theorem fails. The ratio of the two very 
different physical scales in the partonic cross section a^ , produces large logarithms of the 
form ln(m///pr), which are not absorbed by the parton distribution functions fj/ p , unlike 
the ones originating from purely collinear parton emission. These logs are enhanced by a 
1/Pt pre-factor at low pt- (The same factor suppresses them for large pr-) As a result, 
the Higgs pt distribution calculated using the conventional hadronic factorization theorem 
is un-physical in the low pt region. 

To resolve this problem, the differential cross section is split into a part which contains 
all the logarithmic terms {W), and into a regular term (Y): 

^ = W (m H ,p T )+Y(m H ,p T ), (2) 

Since Y does not contain logs of px, it can be calculated using the usual factorization. 
The W term has to be evaluated differently, keeping in mind that failure of the standard 
factorization occurs because it neglects the transverse motion of the incoming partons in the 



2 Here and henceforth, summation on double partonic indices (e.g. ji) is implied. Also, since we 
are focusing on the transverse momentum, longitudinal partonic momentum fractions are either 
kept implicit or, when applicable, integrated over. 
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hard scattering. As has been proven [11], W has a simple form in the Fourier conjugate, 
that is the transverse position (6) space 



W(m H , b) = C n/hl (m H , b) e -^ m »^ C h/h2 (m H , b), 



(3) 



with the Sudakov exponent defined as 



S(m H ,K) 



A(a s (n))\n 



m 



ii 



+ B{a s {ii)) 



(4) 



which resums the large logarithmic terms. f\ The partonic recoil against soft gluons, as 
well as the intrinsic partonic transverse momentum, are included in the generalized parton 
distributions 



Cj/h(m H , b, x) = Cj a (m H , K) <g> f a /h(m H ) (x) F a /h(m H , b, x), 



(5) 



where the convolution is evaluated over the partonic momentum fraction x. The A and B 
functions, and the Wilson coefficients Cj a are free of logs and safely calculable perturbatively 
as expansions in the strong coupling 



A(as) = jr(^-) n A^, etc. 

n=l V 71 J 



(6) 



The process independent non-perturbative functions J- a /hi describing long distance trans 



verse physics, are extracted from low-energy experiments [14]. 

The matching of the low and the high regions is achieved via the Y piece. To correct 
the behavior of the resummed piece in the intermediate and high regions, it is defined 
as the difference of the cross section calculated by the standard factorization formula at a 
fixed order n of perturbation theory and its pt tuh asymptote. [] The resummed cross 



section, to order otg, then reads as 



3 To prevent evaluation of the Sudakov exponent in the non-perturbative region, the impact pa- 



rameter b = \b\ is replaced by 6* = b/y/l + (6/6 max ) 2 . The choice of Co = 2e 7E , where 'Je is the 
Euler constant, is customary. 

4 The expression for the Y term for Higgs production can be found elsewhere [15|. 
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da „„ , daW da^ 



dp T ^ h,Pt + ^2 ^2 

At low pt, the logarithms are large and the asymptotic part dominates the fixed order pt 
distribution. The last two terms in Eq. (0) nearly cancel, and W is a good approximation to 
the cross section. At high p? the logarithms are small, and the expansion of the resummed 
term cancels the pr singular terms (up to higher orders in as), and the cross section reduces 
to the fixed order perturbative result. After matching the resummed and fixed order cross 
sections in such a manner, it is expected that the normalization of the resummed cross 
section reproduces the fixed order total rate, since when expanded and integrated over pt 
it deviates from the fixed order result only in higher order terms. For further details of the 
low pt factorization formalism and its application to Higgs production we refer to the recent 
literature [0,011 • 



(7) 



III. PARTON SHOWERING AND RESUMMATION 

For technical reasons, the initial state parton shower proceeds by a backwards evolution, 
starting at the large (negative) Q 2 scale of the hard scatter and then considering emissions 
at lower and lower (negative) virtualities, corresponding to earlier points on the cascade 
(and earlier points in time), until a scale corresponding to the factorization scale is reached. 
The transverse momentum of the initial state is built up from the whole series of splittings 
and boosts. The showering process is independent of the hard scattering process being 
considered (as long as one does not introduce any matrix element corrections), and depends 
only on the initial state partons and the hard scale of the process. 

Parton showering utilizes the fact that the leading order singularities of cross sections 
factorize in the collinear limit. This is expressed as 

lim |A^, +1 | 2 = ^P a ^ bg ( z )\M n \\ (8) 

PbWPg Pb-Pg 

where A4 n +i is the invariant amplitude for the process producing n partons and a gluon, as 
is the strong coupling constant, Pf, and p g are the 4-momenta of the daughters of parton a, 



and P a ^bg{z) is the DGLAP evolution kernel associated with the a — *> bg splitting. These 
leading order collinear singularities can be factorized into a Sudakov form factor 



shower \ 



which is interpreted as the probability V = exp(— iS shower ) of the partonic evolution from 
scale Qq to Q with no resolvable branchings. This probability can be used to determine the 
scale for the first emission and hence for the whole cascade. The formalism can be extended 
to soft singularities as well by using angular ordering. In this approach, the choice of the 
hard scattering is based on the use of evolved parton distributions, which means that the 
inclusive effects of initial-state radiation are already included. What remains is, therefore, 
to construct the exclusive showers. 

Parton showering resums primarily the leading logs which are universal, that is process 
independent and depend only on the given initial state. In this lies one of the strengths of 
Monte Carlos, since parton showering can be incorporated into a wide variety of physical 
processes. An analytic calculation, in comparison, can resum all logs. For example, the 
low pt factorization formalism sums all of the logarithms with run/pr i n their arguments. 
As discussed earlier, all of the 'dangerous logs' are included in the Sudakov exponent (f|). 
The A and B functions in Eq.(§]) contain an infinite number of coefficients, with the 
coefficients being universal, while the B^ n ^s are process dependent, with the exception of 
B^. In practice, the number of towers of logarithms included in the analytic Sudakov 
exponent depends on the level to which a fixed order calculation was performed for a given 
process. Generally, if a next-to-next-to-leading order calculation is available, then B^ can 
be extracted and incorporated. Extraction of higher coefficients require the knowledge of 
even higher order QCD corrections. So far, only the A^\ A^ and B^ coefficients are 
known for Higgs production but the calculation of B^ is in progress [17]. If we try to 



interpret parton showering in the same language then we can say that the Monte Carlo 
Sudakov exponent always contains a term analogous to A^ . It was shown in Ref . |TS[ that 



a term equivalent to B' 1 ' is also included in the (HERWIG) shower algorithm, and a suitable 
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modification of the Altarelli-Parisi splitting function, or equivalently the strong coupling 
constant as, also effectively approximates the coefficient. [] 

In contrast with the shower Monte Carlos, analytic resummation calculations integrate 
over the kinematics of the soft-gluon emission, with the result that they are limited in 
their predictive power to inclusive final states. While the Monte Carlo maintains an ex- 
act treatment of the branching kinematics, in the original low px factorization formalism no 
kinematic penalty is paid for the emission of the soft-gluons, although an approximate treat- 
ment of this can be incorporated into its numerical implementations, such as ResBos ||[13||. 
Neither the parton showering process nor the analytic resummation translate smoothly into 
kinematic configurations where one hard parton is emitted at large px- In the Monte Carlo 
matrix element corrections, and in the analytic resummation calculation matching, is neces- 
sary. This matching is standard procedure for resummed calculations, and matrix element 



corrections are becoming increasingly common in Monte Carlos ||19|-|2T 



With the appropriate input from higher order cross sections, a resummation calculation 
has the corresponding higher order normalization and scale dependence. The normalization 
and scale dependence for the Monte Carlo, though, remains that of a leading order calcu- 
lation. The parton showering process redistributes the events in phase space, but does not 
change the total cross section (for example, for the production of a Higgs boson). [] 

One quantity which is expected to be well described by both calculations is the transverse 
momentum of the final state electroweak boson in a subprocess such as qq — > W ± X, Z°X 
or gg — > HX, where most of the pt is provided by initial state parton emission. The parton 
showering supplies the same sort of transverse kick as the resummed soft-gluon emission in 



5 This is rigorously true only for the high x or \fr region. 

technically, one could add the branching for q — > g+Higgs in the shower, which would somewhat 
increase the Higgs cross section. However, the main contribution to the higher order .fT-factor 
comes from the virtual corrections and the 'Higgs bremsstrahlung' contribution is negligible. 
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the analytic calculation. Indeed, similar Sudakov form factors appear in both approaches. 
The correspondence between the Sudakov form factors of resummation and Monte Carlo 
approaches embodies many subtleties, relating to both the arguments of the Sudakov factors 
as well as the impact of sub-leading logs 

At a point in its evolution, typically corresponding to the virtuality of a few GeV 2 , 
the parton shower is cut off and the effects of gluon emission at softer scales must be 
parameterized and inserted by hand. This is similar to the somewhat arbitrary division 
between perturbative and non-perturbative regions in the resummation calculation. The 
parameterization is typically expressed in a Gaussian form, similar to that used for the 



non-perturbative k? in a resummation program ||14j| . In general, the value for the non- 
perturbative {kj) needed in a Monte Carlo program will depend on the particular kinematics 
and initial state being investigated. A value of the average non-perturbative of greater 
than 1 GeV, for example, does not imply that there is an anomalous intrinsic kx associated 
with the parton size. Rather, this amount of (kx) needs to be supplied to provide what is 
missing in the truncated parton shower. If the shower is cut off at a higher virtuality, more 
of the 'non-perturbative' kx will be needed. 

IV. Z° BOSON PRODUCTION AT THE TEVATRON 



From a theoretical viewpoint, Z° production at the Tevatron is one of the highest pre- 
cision testing grounds for the effects of multiple soft-gluon emission. The fully differential 
fixed order cross section has been calculated up to 0{a 2 s ), and the A^ 1,2 \ B^ 1,2 \ and C^ 1 ' re- 
summed coefficients are known for this process, and have been numerically implemented |13 



Since the 0(a 2 s ) corrections are relatively small (the order of a percent), the contribution of 
B^ is almost negligible. Thus, nominally the same perturbative physics is implemented in 
the shower Monte Carlos as in the resummation calculation. Any differences between their 
predictions can be ascribed to the small differences in the implementation of the perturba- 
tive physics, and to the different non-perturbative physics they contain. Experimentally, the 
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4- momentum of a Z° boson, and thus its pt, can be measured with great precision in the 
e + e~ decay mode. Resolution effects are relatively minor and are easily corrected. Thus, 
the Z° pt distribution is an excellent probe of the effects of the soft-gluon emission. 

The resolution corrected pr distribution (in the low px region) for Z° bosons from the 
CDF experiment |23| is shown in Figure [I|, compared to both the resummed prediction of 
ResBos, and to two predictions from PYTHIA version 6.125. One PYTHIA prediction uses 
the default value of intrinsic /c™ s = 0.44 GeV (dashed histogram)^], and the second a value 
of 2.15 GeV (solid histogram), per incoming parton. | The latter value was found to give 
the best agreement for PYTHIA with the data, and a similar conclusion has been reached 
m comparisons of the CDF Z° p T data with HERWIG @. All of the predictions use the 
CTEQ4M parton distributions [p5|| . The shift between the two PYTHIA predictions at low 
Pt is clearly evident. As might have been expected, the high p? region (above 10 GeV) is 
unaffected by the value of the non-perturbative kr- Much of the kr 'given' to the incoming 
partons at their lowest virtuality, Qq, is reduced at the hard scatter due to the number of 
gluon branchings preceding the collision. The emitted gluons carry off a sizable fraction of 



the original non-perturbative k? [26]. This point will be investigated in more detail later for 
the case of Higgs production. 

In the resummed calculation it has been shown that, in addition to the perturbative 
physics (Sudakov and Wilson coefficients, Cj a ), the choice of the non-perturbative param- 
eters affects the shape of the distribution in the lowest px region and the location of the 
peak [O]. In order to qualitatively compare this effect to the smearing applied in the Monte 



7 For a Gaussian distribution, /c™ s = 1.13(A;t). 

previous publication []l9) indicated the need for a substantially larger non-perturbative (&t)> 
of the order of 4 GeV, for the case of production at the Tevatron. The data used in the 
comparison, however, were not corrected for resolution smearing, a fairly large effect for the case 
of W — > ev production and decay. 



11 



> 

CD 

O 



X) 

b 



30 
25 



CDF Preliminary 



PP 



-> 



7/Z + X 
7*/Z -> ee 

66 < M 7 / z < 116 GeV/c 2 





NNLO ResBos (CTEQ4M) 
rescaled up +8.4% 



5 10 15 

P t> GeV/c 



20 



FIG. 1. The Z° px distribution (at low pr) from CDF for Run 1 compared to predictions from 
ResBos (curve) and from PYTHIA (histograms). The two PYTHIA predictions use the default (rms) 
value for the non-perturbative (0.44 GeV) and the value that gives the best agreement with the 
shape of the data (2.15 GeV). The normalization of the resummed prediction was rescaled upwards by 
8.4%. The PYTHIA prediction was rescaled by a factor of 1.4 for the shape comparison. (Including only 
soft-gluon QCD corrections, PYTHIA does not contain the QCD K-factor.) 

Carlos, it is possible to bring the resummation formula to a form where the non-perturbative 
function acts as a Gaussian type smearing term. This, using the Ladinsky-Yuan parameteri- 
zation |27|] of the non-perturbative function, leads to an rms value of 2.5 GeV for the effective 
k>T smearing parameter, for Z° production at the Tevatron. This is in agreement with the 
PYTHIA and HERWIG results: to well describe the Z° production data at the Tevatron, 2-2.5 
GeV non-perturbative kx is needed in these implementations. The resummed curve agrees 
with the shape of the data well, which is a non-trivial result, since the resummation cal- 
culation does not contain any free parameters which are fitted to the Z° distribution. 
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FIG. 2. The Z° pt distribution (for the full range of pr) from CDF for Run 1 compared to 
predictions from ResBos (curve) and from PYTHIA (histogram). The normalization of the resummed 
prediction was rescaled upwards by 8.4%. The PYTHIA prediction was rescaled by a factor of 1.4 for 
the shape comparison. 

Even with the optimal non-perturbative k r ™ = 2.15 GeV, there is slight shape difference 
between the shower Monte Carlo and the data. This might be partially due to the lack of 
the B^ coefficient in the shower Monte Carlos. This is supported by the fact that, if the 
coefficient was not included in the resummed prediction, the result would be an increase 
in the height of the peak and a decrease in the rate between 10 and 20 GeV, leading to a 



better agreement with the best PYTHIA prediction JT2| . 

The Z° pt distribution is shown over a wider pt range in Figure fj. The PYTHIA and 
ResBos predictions both describe the data well. Note especially the agreement of PYTHIA 
with the data at high px, made possible by explicit matrix element corrections (from the 
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subprocesses qq — > Z°g and gq — > Z°q) to the Z° production process. f\ 



V. DIPHOTON PRODUCTION 

Most of the experience that we have for comparisons of data to resummation calculations 
or Monte Carlos is based on Drell-Yan pair production, that is mostly on qq initial states. It 
is important then to examine diphoton production at the Tevatron, where a large fraction 
of the contribution at low mass is due to gg scattering. The prediction for the diphoton p? 
distribution at the Tevatron, from PYTHIA (version 6.122), is shown in Figure 0, using the 
experimental cuts applied in the CDF analysis p8| . About half of the diphoton cross section 



at the Tevatron is due to the gg subprocess, and the diphoton pt distribution is noticeably 
broader for the gg subprocess than for the qq subprocess. 

A comparison of the px distributions for the two diphoton subprocesses (qq, gg) in two 
recent versions of PYTHIA, 5.7 and 6.1, is shown in Figure [|. There seems to be little difference 
in the pt distributions between the two versions for both subprocesses. As will be shown 
later, this is not true for the case of Higgs production. In Figure |5] are shown the ResBos 
predictions for diphoton production at the Tevatron from qq and gg scattering compared to 
the PYTHIA predictions. The gg subprocess predictions in ResBos agree well with those from 
PYTHIA while the qq px distribution is noticeably broader in ResBos. The latter behavior 



Slightly different techniques are used for the matrix element corr ections by PYTHIA @ and 
by HERWIG |20|| . In PYTHIA, the parton shower probability distribution is applied over the whole 
phase space and the exact matrix element corrections are applied only to the branching closest 
to the hard scatter. In HERWIG, the corrections are generated separately for the regions of phase 
space unpopulated by HERWIG (the 'dead zone') and the populated region. In the dead zone, the 
radiation is generated according to a distribution using the first order matrix element calculation, 
while the algorithm for the already populated region applies matrix element corrections whenever 
a branching is capable of being 'the hardest so far'. 
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FIG. 3. A comparison of the PYTHIA predictions for diphoton production at the Tevatron for the 
two different subprocesses, qq and gg. The same cuts are applied to PYTHIA as in the CDF diphoton 
analysis. 

is due to the presence of the Y piece in ResBos at moderate pt, that is the matching of 
the qq cross section to the fixed order qq — > 77^ at high p?. The corresponding matrix 
element correction is not implemented in PYTHIA. The PYTHIA and ResBos predictions for 
99 ~ * 77 agree in the moderate pt region, even though the ResBos prediction has the Y 
piece present and is matched to the matrix element piece gg — > 775 at high p?, while there 
is no such matrix element correction for PYTHIA. This demonstrates the smallness of the Y 
piece for the gg subprocess, which is the same conclusion that was reached in Ref. 0. One 
way to understand this is recalling that the gg parton-parton luminosity falls very steeply 
with increasing partonic center of mass energy, This falloff tends to suppress the size of 
the Y piece since the production of the diphoton pair at higher p? requires larger values of 
longitudinal momentum fractions. 
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FIG. 4. A comparison of the PYTHIA predictions for diphoton production at the Tevatron for the 
two different subprocesses, gg (top) and qq (bottom), for two recent versions of PYTHIA. The same 
cuts are applied to PYTHIA as in the CDF diphoton analysis. 



Comparisons of the diphoton data measured by both the CDF [25] and DO [29] experi- 
ments indicate a disagreement of the observed diphoton px distribution with the NLO QCD 
predictions fJU|. In particular, the pt distribution in the data is noticeably broader than 
that predicted by fixed order QCD calculations, but in agreement with the predictions of 
ResBos The transverse distributions of the diphoton pair are particularly sensitive to 
the effects of soft-gluon radiation. The pt distribution, for example, is a delta function 
calculated at leading order, and is strongly smeared by the all order Sudakov factor. Given 
the small size of the diphoton cross section at the Tevatron, the comparisons for Run 1 
are statistically limited. A more precise comparison between theory and experiment will be 
possible with the 2 fb^ 1 or greater data sample that is expected for CDF and DO in Run 
2, and at the LHC. The Monte Carlo prediction for the diphoton production cross section, 
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FIG. 5. A comparison of the PYTHIA and ResBos predictions for diphoton production at the 
Tevatron for the two different subprocesses, gg (left) and qq (right). The same cuts are applied to 
PYTHIA and ResBos as in the CDF diphoton analysis. The bottom figures show the same in logarithmic 
scale. 

as a function of the diphoton and using cuts appropriate to ATLAS and CMS, is shown 
in Figure [| As at the Tevatron, about half of the cross section is due to gg scattering and 
the diphoton p T distribution from gg scattering is noticeably broader than that from qq 
production. 

In Figure [7| is shown a comparison of the diphoton distribution at the LHC for two 
different versions of PYTHIA, for the two different subprocesses. Note that the pt distribution 
in PYTHIA version 5.7 is somewhat broader than that in version 6.122 for the case of gg 
scattering. The effective diphoton mass range being considered here is lower than the 150 
GeV Higgs mass that will be considered in the next section. As will be seen, the differences in 
soft-gluon emission between the two versions of PYTHIA are larger in that case. In Figure 
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FIG. 6. A comparison of the PYTHIA predictions for diphoton production at the LHC for the two 
different subprocesses, qq and gg. Similar cuts are applied to the diphoton kinematics as those used by 
ATLAS and CMS. 

are shown the ResBos predictions for diphoton production at the LHC from qq and gg 
scattering compared to the PYTHIA predictions. Again, the gg subprocess in ResBos agree 
well with PYTHIA, while the qq pt distribution is noticeably broader in ResBos, for the 
reasons cited previously. 



A comparison of the SM Higgs pt distribution at the LHC, for a Higgs mass of 150 GeV, 
is shown in Figure |], for ResBos and the two recent versions of PYTHIA. As before, PYTHIA 
has been rescaled by a factor of 1.7, to agree with the normalization of ResBos to allow for 
a better shape comparison. There are a number of features of interest. First, the peak of 
the resummed distribution has moved to pr ~ 11 GeV (compared to about 3 GeV for Z Q 



VI. HIGGS BOSON PRODUCTION 
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FIG. 7. A comparison of the PYTHIA predictions for diphoton production at the LHC for the two 
different subprocesses, gg (top) and qq (bottom), for two recent versions of PYTHIA. Similar cuts are 
applied to the diphoton kinematics as are used by ATLAS and CMS. 

production at the Tevatron). This is partially due to the larger mass (150 GeV compared 
to 90 GeV), but is primarily because of the larger color factors associated with initial state 
gluons {Ca = 3) rather than quarks (CV = 4/3), and also because of the larger phase 
space for initial state gluon emission at the LHC. Second, and more importantly, there is 
a substantial disagreement for the shape of the Higgs pr distribution between ResBos and 
PYTHIA 5.7, and between the two versions of PYTHIA. An understanding of the reasons for 
these differences is critical, as the shape of the transverse momentum distribution for the 
Higgs in the low to moderate pt region, can dictate the details of both the experimental 
triggering and the analysis strategies for the Higgs search. As noted before, most of the 
studies for Higgs production by CMS and ATLAS have been based on PYTHIA 5.7. For 
the CMS detector, the higher pt activity associated with Higgs production in version 5.7 
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FIG. 8. A comparison of the PYTHIA and ResBos predictions for diphoton production at the LHC 
for the two different subprocesses, gg (left) and qq (right). Similar cuts are applied to PYTHIA and 
ResBos as in the ATLAS and CMS diphoton analyses. The bottom figures show the same in logarithmic 
scale. 

allows for a more precise determination of the event vertex from which the Higgs (decaying 
into two photons) originates. Vertex pointing with the photons is not possible in the CMS 
barrel, and the large number of interactions occurring with high intensity running will mean 
a substantial probability that at least one of the interactions will have more activity than 
the Higgs vertex, thus leading to the assignment of the Higgs decay to the wrong vertex, 
and therefore a noticeable degradation of the 77 effective mass resolution. 

In comparison to ResBos, the older version of PYTHIA produces too many Higgs events 
at moderate pt- Two changes have been implemented in the newer version 6.1. The first 
change is that a cut is placed on the combination of z (longitudinal momentum fraction) 
and Q 2 (partonic virtuality) values in a branching: u = Q 2 — s(l — z) < 0, where s refers to 
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FIG. 9. A comparison of predictions for the Higgs pr distribution at the LHC from ResBos and 
from two recent versions of PYTHIA. The ResBos and PYTHIA predictions have been normalized to the 
same area. The bottom figure shows the same in logarithmic scale. 

the squared invariant mass of the subsystem of the hard scattering plus the shower partons 
considered to that point. The association with u is relevant if the branching is interpreted 
in terms of a 2 — > 2 hard scattering. The corner of emissions that do not respect this 
requirement occurs when the Q 2 value of the space-like emitting parton is little changed and 
the z value of the branching is close to unity. This effect is mainly for the hardest emission 
(largest Q 2 ). The net result of this requirement is a substantial reduction in the total amount 
of gluon radiation |32| . In the second change, the parameter for the minimum gluon energy 



°Such branchings are kinematically allowed, but since matrix element corrections would assume 
initial state partons to have Q 2 = 0, a non-physical u results (and thus no possibility to impose 
matrix element corrections). The correct behavior is beyond the predictive power of leading log 
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emitted in space-like showers is modified by an extra factor roughly corresponding to the I/7 



factor for the boost to the hard subprocess frame [32]. The effect of this change is to increase 
the amount of gluon radiation. Thus, the two effects are in opposite directions but with 
the first effect being dominant. In principle, this problem could affect the pt distribution 
for all PYTHIA processes. In practice, it affects only gg initial states, due to the enhanced 
probability for branching in such an initial state. 

The newer version of PYTHIA agrees well with ResBos at low to moderate px, but falls 
below the resummed prediction at high p?- The agreement of the predictions of PYTHIA 
6.1 with those of ResBos, in the low to moderate pt region, gives some credence that the 
changes made in PYTHIA move in the right direction. The disagreement at high p? can be 
easily understood: ResBos switches to the NLO Higgs+jet matrix element at high p?, while 
the default PYTHIA can generate the Higgs pt distribution only by initial state gluon radi- 
ation, using the Higgs mass squared as maximum virtuality. High px Higgs production is 
another example where a 2 — > 1 Monte Carlo calculation with parton showering cannot com- 
pletely reproduce the exact matrix element calculation, without the use of matrix element 
corrections. The high p T region is better reproduced if the maximum virtuality Q 2 max is set 
equal to the squared partonic center of mass energy, s, rather than m 2 H . This is equivalent 
to applying the parton shower to all of phase space. However, this has the consequence of 
depleting the low pt region, as 'too much' showering causes events to migrate out of the 
peak. The appropriate scale to use in PYTHIA (or any Monte Carlo) depends on the pt range 
to be probed. If matrix element information is used to constrain the behavior, the correct 
high pt cross section can be obtained while still using the lower scale for showering. Thus, 
the incorporation of matrix element corrections to Higgs production (involving the processes 
gq — ► qH,qq — > gH, gg — ► gH) is the next logical project for the Monte Carlo experts, in 



order to accurately describe the high p? region, and is already in progress [2c, 



Monte Carlos. 
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FIG. 10. A comparison of predictions for the Higgs pr distribution at the Tevatron from ResBos 
and from two recent versions of PYTHIA. The ResBos and PYTHIA predictions have been normalized to 
the same area. The bottom figure shows the same in logarithmic scale. 

A comparison of the two versions of PYTHIA and of ResBos is also shown in Figure |1(] 
for the case of Higgs production at the Tevatron with center-of-mass energy of 2.0 TeV (for 
a hypothetical SM Higgs mass of 100 GeV) . R The same qualitative features are observed 
as at the LHC: the newer version of PYTHIA agrees better with ResBos in describing the low 
Pt shape, and there is a falloff at high unless the larger virtuality is used for the parton 
showers. The default (rms) value of the non-perturbative kx (0.44 GeV) was used for the 



ii 



As an exercise, events for an 80 GeV W boson and an 80 GeV Higgs were generated at the 



Tevatron using PYTHIA 5.7 [34 1. A comparison of the distribution of values of u and the virtuality Q 
for the two processes indicates a greater tendency for the Higgs virtuality to be near the maximum 
value and for there to be a larger number of Higgs events with positive u. 
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FIG. 11. A comparison of predictions for the Higgs pt distribution at the LHC from ResBos, a 
recent version of PYTHIA, and HERWIG. All predictions have been normalized to the same area. The 
bottom figure shows the same in logarithmic scale. 

PYTHIA predictions for Higgs production. 

VII. COMPARISON WITH HERWIG 

The variation between versions 5.7 and 6.1 of PYTHIA gives an indication of the uncer- 
tainties due to the types of choices that can be made in Monte Carlos. The prescription that 
u be negative for all branchings is a choice rather than an absolute requirement. Perhaps 
the better agreement of version 6.1 with ResBos is an indication that the adoption of the u 
restrictions was correct. Of course, there may be other changes to PYTHIA which would also 
lead to better agreement with ResBos for this variable. 

Since there are a variety of choices that can be made in Monte Carlo implementations, it 
is instructive to compare the predictions for the distribution for Higgs production from 
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ResBos and PYTHIA with that from HERWIG version 5.6. The HERWIG prediction, for the 
Higgs pt distribution at the LHC, is shown in Figure [11], along with the PYTHIA and ResBos 
predictions, all normalized to the ResBos prediction. Q (In all cases, the CTEQ4M parton 
distribution was used.) The predictions from HERWIG and PYTHIA 6.1 are very similar, with 
the HERWIG prediction matching the ResBos shape somewhat better at low p?- It is inter- 
esting that HERWIG matches the ResBos prediction so closely without the implementation 
of any kinematic cuts as in PYTHIA 6.1. Perhaps the reason is related to the treatment of 
color coherence in the HERWIG parton showering algorithm. For reference, the absolutely 
normalized predictions from ResBos, PYTHIA, and HERWIG for the Pt distribution of a 150 



GeV Higgs at the LHC are shown in Figure 12 



VIII. NON-PERTURBATIVE K T 

A question still remains as to the appropriate input value of non-perturbative k? in the 
Monte Carlos to achieve a better agreement in shape, both at the Tevatron and at the 
LHC. In Figures [13] and [14] are shown comparisons of ResBos and PYTHIA predictions for 



the Higgs pt distribution at the Tevatron and the LHC. The PYTHIA prediction (version 6.1 
alone) is shown with several values of non-perturbative h?. Surprisingly, no difference is 
observed between the predictions with the different values of fey, with the peak in PYTHIA 
always being somewhat below that of ResBos. This insensitivity can be understood from 
the plots at the bottom of the two figures, which show the sum of the non-perturbative 
partonic initial state fcr's (fcyi+fcy2) at Qo and at the hard scatter scale Q. Most of the kr 
is radiated away, with this effect being larger (as expected) at the LHC. The large gluon 
radiation probability from a gg initial state and the greater phase space available at the 



12 The normalization factors (ResBos/Monte Carlo) are 1.68 for both versions of PYTHIA, and 1.84 
for HERWIG. 

13 This has also been explored for direct photon production in Ref. p5|]. 
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FIG. 12. A comparison of predictions for the Higgs px distribution at the LHC from ResBos, two 
recent versions of PYTHIA, and HERWIG. All predictions have their absolute normalizations. The bottom 
figure shows the same in logarithmic scale. 

LHC lead to a stronger degradation of the non-perturbative kx than was observed with Z° 
production at the Tevatron. 

For completeness, a comparison of PYTHIA and ResBos is shown in Figure ITB for Z° boson 



production at the LHC. There are two points that are somewhat surprising. First, there is 
still a very strong sensitivity to the value of the non-perturbative kx used in the smearing. 
Second, the best agreement with ResBos is obtained with the default value (0.44 GeV), in 
contrast to the 2.15 GeV needed at the Tevatron (cf. Figjl]) . Note again the agreement 
of PYTHIA with ResBos at the highest values of Z° px due to the explicit matrix element 
corrections applied. The sum of the incoming parton kx distributions, both at the scale 
Qo and at the hard scattering scale, are shown in Figure [16| for several different starting 
(rms) values of primordial kx (per parton). There is substantially less radiation for a qq 
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FIG. 13. (top) A comparison of the PYTHIA predictions for the distribution of a 100 GeV Higgs 
at the Tevatron using the default (rms) non-perturbative kx (0.44 GeV) and a larger value (4 GeV), at 
the initial scale Qq and at the hard scatter scale. Also shown is the ResBos prediction, (bottom) The 
vector sum of the intrinsic krp (kxi+ &T2) for the two initial state partons at the initial scale Qq and 
at the hard scattering scale, for the two values of primordial kx- 

initial state than for a gg initial state (as in the case of the Higgs), leading to a noticeable 
dependence of the Z° pt distribution on the primordial kx distribution. 



IX. CONCLUSIONS 

An understanding of the signature for Higgs boson production at either the Tevatron or 
the LHC depends upon the understanding of the details of the multiple soft-gluon emission 
from the initial state partons. This soft-gluon radiation can be modeled either in a Monte 
Carlo or by an analytic resummation calculation, with various choices possible in both 
implementations. A comparison of the two approaches helps in understanding their strengths 
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FIG. 14. (top) A comparison of the PYTHIA predictions for the px distribution of a 150 GeV Higgs 
at the LHC using the default (rms) non-perturbative kr (0.44 GeV) and a larger value (4 GeV), at 
the initial scale Qq and at the hard scatter scale. Also shown is the ResBos prediction, (bottom) The 
vector sum of the intrinsic kr (^ti+^T2) for the two initial state partons at the initial scale Qq and 
at the hard scattering scale, for the two values of primordial kx- 

and weaknesses, and their reliability. The data from the Tevatron that either exist now, or 
will exist in Run 2, and from the LHC will be extremely useful to test both methods. 
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FIG. 15. A comparison of the predictions for the px distribution for Z° production at the LHC 
from PYTHIA and ResBos, where several values of kr have been used to make the PYTHIA predictions. 

At the final stage of preparing this manuscript, we became aware of a new preprint ||36|| , 
which studied the A and B functions for Higgs production up to 0(ag), and presented an 
expression for the coefficient B^ 2 \ 
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